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Abstract — Ananda Mohan suggested that the first New Chinese 
Remainder Theorem introduced by Wang can be derived from 
the constructive proof of the well-known Chinese Remainder 
Theorem (CRT) and claimed that Wang's approach is the same 
as the one proposed earlier by Huang. Ananda Mohan's proof is 
however erroneous and we show here that Wang's New CRT I 
is a rewriting of an algorithm previously sketched by Hitz and 
Kaltofen. 



I. Introduction 

A Residue Number System (RNS) is defined by a set of 
n pairwise relatively prime numbers Pi, . . . , Pn G N. Let 
M — n"=i Pi- Then, any integer X belonging to Z/AfZ — 
{0, . . . , Af — 1} has a unique RNS representation given by 



(xi = \X\p^ , . . . ,Xn = |X|p^)rns, where \X\p^ denotes 
X mod Pi. The major advantage of the RNS, which explains 
its popularity in digital signal processing, is that addition, 
subtraction, and multiplication on large integers X and Y 
G Z/A/Z are replaced by n modular operations performed 
in parallel, and whose operands are bounded by the moduli 
P,: 



\XoY\ 



M 



oyi 



Pi 



^ Vn \p )■ 



where o denotes addition, subtraction, or multiplication. How- 
ever, an RNS is not a positional number system, thus making 
conversion to integer, comparison, and division difficult to 
perform. The constructive proof of the Chinese Remainder 
Theorem (CRT) provides an algorithm to design a residue-to- 
binary converter: 



X 



4=1 
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where Si = M/Pi and |l/si|p. is the multiplicative inverse 
of Si modulo Pi. The main drawback of this approach is 
that it requires multiplication by the s/s, which are large 
numbers, and modulo M operations. The Mixed Radix System 
(MRS) associated with each RNS offers another conversion 
scheme. A number X G Z/A/Z is represented by an n-tuple 
{x[,x'2, . . . ,x',^)mrs such that X = x'l+ccjPi +2:3^1 P2 + - • • + 
x'^PiP2 . . . P„_i, and x'^ < P^, Vi G {1, . . . , n]. The Mixed 
Radix Conversion (MRC) is however a strictly sequential 
process. 
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Wang [1] proposed a novel conversion algorithm, called 
New Chinese Remainder Theorem I (New CRT I), based on 
a new set of constants hi which are smaller than the s^'s 
used in the original CRT (Equation ([T]i)- He claimed that the 
New CRT I is "substantially different from the MRC and the 
CRT approaches". A few months later, Ananda Mohan [2] 
suggested that the New CRT I is not different from the 
approach introduced by Huang [3]. His proof is unfortunately 
erroneous and this paper aims at correcting it: we show that 
the New CRT I is a rewriting of an algorithm sketched by Hitz 
and Kaltofen [4]. 

II. Huang's Algorithm 

Huang proposes a way to compute Equation ^ without 
modulo M operations [3]. He defines 



Xi = Si 



Pi 



By noting that 



if i = j, 
otherwise. 



and 



Xi = (xi,0,0,...,0,0)rns, 
X2 - (0,a;2,0,...,0,0)RNs, 



Xri 



(0,0,0,. ..,0,x„) 



RNS, 



Huang suggests to compute the MRS representations of the 
Xi's by means of tables. Let x'^ ^ denote the jth MRS digit 
of Xi. It is worth noticing that Xi j — when j < i: 



Xi — {Xii,Xi2t^1,^t 



■ 1 -^1,11-1 



^l,Tl) 



MRS, 



Xo 



Xri 



(0, ^2 2, 2:2 3, . . . , X2,n-1, X2,n) 



MRS, 



(0,0,0,...,0,x' ) 



MRS- 



Szabo and Tanaka's algorithm [5] allows one to determine 
all the MRS digits of Xi from Xi. Since x[ i — xi [5], this 
conversion step requires n{n + l)/2 — 1 tables. The addition 
of the Xi's is then performed in the MRS. Recall that the sum 
of two digits of weight Pi ... Pi generates a carry v if it can 
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be written as u + uPi+i, with u < P^+i. At the end of this 
addition process, Huang gets: 



X 



x' Pi . 



■Pn-1 
■ Pn-1, 



qM\ 
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where q G N is the output carry. Since arithmetic is performed 
modulo M, qM vanishes and Huang obtains eventually the 
MRS digits of X. 



Example 1 Let Pi 

representation of X 
are defined by: 

{8,0,0)rns 
{0,7,0)rns 
(0,0,4)r^s 



11, P2 = 13, and P3 = 17. The RNS 
514 is then X = (8, 7, 4)kvs- The Xi's 



= (8,4,12)m«s = 1768, 
= (0, 3, 5)m«s - 748, 
= (0,0,3)m«s = 429. 



Let us compute the sum of the Xi 's in MRS. By propagating 
the carries, we eventually obtain the MRS digits of X: 

X = |8 + 7-Pi +20-FiF2|m 

= |8 + 7-Pi + (3 + F3)-PiP2|m 

= |8 + 7-Pi+3-PiP2 + PiP2P3|m 

= 8 + 7-Fi+3-PiF2 

= (8,7,3)mrs = 514. 

III. HiTZ AND KALTOFEN'S REMARK 

The main drawback of Huang's method lies in the n{n + 
l)/2 — 1 tables involved in the conversion of the Xi's 
from RNS to MRS. Hitz and Kaltofen [4] suggest to carry 
out the products |xi |l/si|p. |^ by means on n modular 
multipliers, where the constants jl/s^jp are precomputed. 
Then, they look at the MRS representation of the constants 
Si = . . . , Si, n)MRS, and evaluate Equation ([T]) in MRS. 
Since Si is a product of modulus, we know that Sij = if 
j < i. Furthermore, the MRS representation of s„ ~ M/Pn 
is (0, . . . , 0, 1)mrs- Thus, n{n + l)/2 — 1 multiplications are 
required in this step. Then, it suffices to compute the sum 



of the Si 



|l/s. 



' IPi I p 



in MRS in order to get the MRS 



digits of X. Hitz and Kaltofen describe an architecture which 
efficiently deals with the carries to perform this task in [4]. 

Example 2 (Example [T] continued) Let us apply Hitz and 
Kaltofen's approach to convert X — (8, 7,4:)rns- First of all, 
we compute the products \xi |l/si|p. \p and obtain: 



3. 





1 




= 8, 




1 








1 




Xl 












= 4, 










Si 


Pi 


Pi 




32 


P'l 


P2 




S3 


P3 



The MRS representations of the Si 's are given by: 



Sl 
S2 

S3 



(1, 7, 1)mrs 
(0,4, 1)m«s 
(0,0,1)m«s 



221, 
187, 
143. 



Thus, 
X 



8 • (1, 7, 1)mrs + 4 ■ (0, 4, 1)mrs + 3 • (0, 0, 1)mrs 
+ 72-Pi + 15-PiP2|^, 
+ (7 + 5-P2)-Pi + 15-PiP2U, 
+ 7-Pi + 20-PiP2Uf 
+ 7-Pi + (3 + P3)-PiP2|m 
+ 7-Pi+3-PiP2 + PiP2P3|m 
, 7, 3)mrs = 514. 



Hitz and Kaltofen also point out that the multiplications by 
the Xi's could be saved thanks to the second form of the CRT: 
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Since 
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1 if j, 
otherwise. 



we obtain another formula to compute X from the Xi's: 

X = |xi • (1,0,0,..., 0,0)rns + 

X2-(0,1,0,...,0,0)rns + ...+ 
Xn ■ (0,0,0, ... ,0, 1)rns|m- 

The disavantage of this approach is that the Si|l/si|p 's 
are larger than the Si's, thus leading to larger carries [4]. 
Therefore, Hitz and Kaltofen did not further investigate this 
solution. We will prove in the next section that this algorithm 
turns out to be Wang's New CRT I. It is worth noticing that 
Huang stores the MRS digits of all numbers belonging to 
Z/A/Z, whereas Hitz and Kaltofen only need the MRS digits 
of the Si's or Si |l/si|p,'s (n{n + l)/2 numbers). 

Example 3 (Example [T] continued) Let us now convert X = 
(8,7,4)ra's according to the second form of the CRT given 
by Equation (O. Szabo and Tanaka's algorithm allow us to 
compute the MRS digits of the constants si |l/si|p .• 



51 |l/si|p^ 

52 Il/S2|p2 
S3. |1/s3|p3 



(1,7,1W-221, 
(0,6,10)m«s= 1496, 
(0,0,5)m«s = 715. 



Thus, we have 
X 



8 ■ (1, 7, 1)mrs + 7 ■ (O, 6, 10)m«s + 4 • (0, 0, 5)m«5 
+ 98-Pi +98-PiP2|^f 
+ (7 + 7-P2)-Pi+98-PiP2|^, 
+ 7-Pi + 105-PiP2Uf 
+ 7-Pi + (3 + 6-P3)-PiP2U 
+ 7 • Pi + 3 • P1P2 + 6 • P1P2P3IM 
(8, 7, i)MRs = 514. 
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IV. Wang's New CRT I Revisited 
Wang's New CRT I is based on the following identity [1]: 



X = \xi + ki{x2 ~ xi)Pi + k2{x:!. - X2)PiP2 

)Pi . . . P„-i|m, 



where 



(3) 



(4) 



with 1 < i < n — 1. According to Wang, the New CRT I is a 
fast conversion algorithm substantially different from the CRT 
approach. Let us prove that Equation (O is a rewriting of the 
CRT defined by Equation (|2|. Consider three pairwise prime 
integers a, h, and c. The following property hold^: 
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(5) 
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If a — P2 ■ ■ ■ Pn, b — Pi, and c = 1, we have: 

= l-Pi 



Sl 



P2■■■P,^ 



|l-fclA|p,...P„ 



Assume now that 2<i<n — 1, a = Pi+i 
and c — Pi . . . Pi=i. We obtain: 



. Pn, b = P„ 
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p... p 



i-1 



= \ki-i 
Eventually, we note that: 



ki Pi\p^_ 



■P^ 
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Pl...Pn~l 



Pi... p„-l 

= kn-1 ■ Pi ■ . . Pn-1- 

Starting from Wang's New CRT I, we have: 

X = \xi + ki{x2 - Xi)Pi + k2{x3 ~ X2)PlP2 

)Pi . . . P,i-i|j\/ 

= |(l-fciPi)a;i + (fci-fc2P2)a;2Pi + ...+ 

(fc„_2 - kn-lPn-l)Xn-lPl ■ ■ ■ Pn-2 + 
kn-lPl ■ ■ ■ -Pn-lUf 

= I |1 - fciFil^^xi + \ki - k2P2\p^,,,p^ X2P1 + 

\kn~2 ^ 1-^" — 1 lp„_iP„ Xn—lPl ■ ■ ■ Pn — 2 

kn~lXnPl ■ ■ ■ Pn-l\M 
1 



M 



which is the second form of the CRT defined by Equation (|2|i. 
Note that the ki's are the MRS digits of numbers congruent 

'in order to prove this equality, it suffices to consider the solutions of the 
Diophantine equation ax + by = |l/c|^i,- See for instance [6] for details. 
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to the Si 's modulo M. We have: 

),■•■, 0)mrs (mod M), 
(0,...,0,fc,_i,-fci,0, ...,0)mrs (mod M), 
0, fc„_i)MRs (mod M), 

where 2 < « < n — 1. Therefore, Ananda Mohan is wrong 
when he writes in [2] that Wang suggests to use the MRS 
representation of the Xi's, which is the technique described 
by Huang. We give two examples to illustrate that Wang 
rediscovered the second form of the CRT (Equation (|2]i) and 
that the explicit computation of the fc^'s is useless. 

Example 4 (Example [T] continued) Let us convert now X — 
(8, 7,4)^jvs according to Equation 0. We find that ki = 201 
and k2 = 5. We easily check that: 



Sl |l/si|pj = (1,7, 1)mrs 

S2|l/S2|p, = (0,6,10)MfiS 

S3 |l/s3|p, = (0,0, 5)m«s- 



(1,-201,0W (modM), 
(0,201,-5)Mfis (modM), 



In order to avoid a multiplication by 201, Wang recommends 
to work with the following set of constants [1 ]: 



|1 - fclfl|pjP2...p„ 

\ki — fci+iPi+i|p.^ 

I^n— 1 Ip^^ 



if 1 = 0, 
P„ ifl<i<n-2, 
if i = n ~ 1. 



He suggests to compute the MRS digits of ag and YYj=i ^i' 
\ < i < n — 1. From the previous results, it is obvious that 
these numbers are nothing but the Si\l/ Si\p,'s. Therefore, 
Wang performs the following operations: 

1) computation of the ki's, i.e. the MRS digits of numbers 
congruent to the Si |l/si|p 's modulo M; 

2) computation of the Si \ \/si\p, 's from the ki's; 

3) computation of the MRS digits of the Si |l/si|p, 's ac- 
cording to Szabo and Tanaka's algorithm. 

Since the Si |l/sj|p 's only depend on the moduli set, it not 
necessary to compute the ki 's. 

Example 5 Wang et al. [7] proposed a converter for the RNS 
defined by Pi = 2", P2 = 2" + 1, and P3 = 2" - 1. They ex- 
plained that the ki 's of the New CRT I allowed them to achieve 
better performance in terms of speed and area. Let us show 
that the same result can be obtained without computing the 
ki's. Szabo and Tanaka's conversion algorithm [5] provides 
us with the MRS digits of the Si \ l/si\p. 's: 



(0,1,0)r^s 
(0,0,1)r^vs 



(1, 1, 2" — 2)mrs, 

(0,2",2"-i-l)Mfi5, 

(0,0,2"-1)mrs. 



In the following, we keep the somewhat confusing notation 
used by Wang et al. [7]: xi = \^\p.^' ^2 — \X\p-^^> and 2:3 = 
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\X\p^. We have: 

X = |a;2 • (l,0,0)«vs + a;3 • (0,l,0)/ws + 

xi ■ (0,0, 1)rns\m 
= \X2 + {x^ + X3 • 2") • 2" + (xi • 2"-i + 

X2 • (2" - 2) + X3 • (2"-i - 1)) • 2" . (2" + 1)|m 
= \X2 + {x^ + X3 • (2" + 1) - X3) • 2" + (xi • 2"-i + 

X2 ■ (-2") + X3 ■ (2"-i - 1)) ■ 2" ■ (2" + 1)|m 
= a;2 + 2"-|(a;2-a;3) + 

(a;i-2a;2+a;3)-2"-i-(2" + l)|22n_i, 

w/i/c/z is the formula obtained by Wang et al. (see Equation (7) 
in [7]). 

V. Conclusion 

We proved that the New CRT I was solely based on the 
original CRT, of which it was only a mere rewriting, and 
that Wang rediscovered an algorithm sketched by Hitz and 
Kaltofen in [4]. We also explained why the comment on the 
New CRT I by Ananda Mohan [2] is erroneous. 
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